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Preface 

These are notes of a series of lectures on sieves, presented during the Special 
Activity in Analytic Number Theory, at the Max-Planck Institute for Mathe- 
matics in Bonn, during the period January-June 2002. The notes were taken 
by Boris Moroz, and it is a pleasure to thank him for writing them up in the 
current form. In addition, thanks are due to the institute for its hospitality 
and financial support, and also to the American Institute of Mathematics, 
where these notes were put into final form. 

Being lecture notes they are not intended to have the formal style of a 
textbook. Nor is there any significant claim to novelty. In particular the 
material in §§3 & 4 owes much to the book of Halberstam and Richert 0. 
In addition to the latter work the reader may find it helpful also to consult 
Greaves' exposition 0. 

1 Introduction 

Here are some typical questions which can be expressed as sieve problems: 

(i) Is every even integer n > 2 a sum of two primes? {Goldbach's conjec- 
ture) ; 

(ii) Are there infinitely many pairs of primes {p, q) with q = p + 27 {the 
twin primes conjecture); 

(iii) Are there arbitrary long arithmetic progressions consisting only of 
primes ? ; 

(iv) Are there infinitely many primes of the form ra^ + 1 with n G N ? ; 

(v) Is it true that, for every n G N, there is a prime p in the range 
n'^ < p < (n + 1)"^ 7 ; 

(vi) For every £ > is there an integer N{e) such that the interval 
[N, N + iV^] contains a square-free number, as soon as > N{e)7 ; 



(vii) Is it true that, for some 7 < 1/4 and every sufficiently large x, there 
is a pair of natural numbers (m, n) such that x < + < x + x'^ 1. 
Exercise. Prove that 

{(m, n) : m, n e N, x <m^ + 1%^ <x + 8x^/^} 7^ 

for all sufficiently large x. 

Let ^ be a finite subset of N, and P C P, where P denotes the set of all 
primes. For any positive real z, set 

S{A, V; z) # {n e A : p\n ^ p > z for p e V}. 

Example 1. Let ^ = {n e N : n<x} scadV ^ P. Then 

S{A, 7^; := # {n e N : n < x, p\n ^ p> z ior p e P}, 

so that S{Ai V; z) = 7:{x) — tt{z) for y/x < z < x, where, as usual, 

7r(a;) := # {p e P : p<x}. 

Example 2. Let A = {n{2N-n) : n e N, 2 < n < 2A^ - 2} and = P. 
Then 

S{A, V; V2N) 

= # {p e P : 2A^ - p e P, V2N <p,2N -p<2N - 2}. 

This relates to Goldbach's problem. 
Example 3. Let 

A = {n : n< x}, V^{peF: p = 3 (mod 4)}. 

Then 

S{A, V;x) {n eN : n< x, p f n for p e P, p = 3 (mod 4)}, 
so that 

S{A, P; .t) = # {n G N : n < x, n = l^+m^ with l,m eN, h.c.f.(/, m) = 1}. 
We can therefore detect sums of two squares. 

Let now n := U{V, z) := H P> ^^'^ let := {n e N : nd e A}. It 

p<z,p&V 

follows that 

s{AV;z)= E 1= E Mc?)= E (1.1) 

(n,n)=l <i|(n,n) 
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Main assumption. Suppose that 

^Aa^^X + R,, (1.2) 

with an absolutely multiplicative ui{d), satisfying the conditions uj{d) > for 
deN and uj{p) ^OforpeF\V. 

In particular, ^ A = X + Ri. We think of the remainder R^ term as 
being small compared to the main term ^^^X. 

In the notation of Example 2, one may write 

^Ad ^ #{n{2N -n) :neN, 2 <n<2N -2, d\n(2N -n)} 
= ^ :2<n<2A^-2, n = m (mod d)} 

m(mod d) 
d\m{2N-m) 

2^ (— + r(ci,m)), 

m(mod d) 
d\mi2N-m) 

where r{d,m) <^ 1. Thus if a;((i) = # {m(mod d) : d\m{2N — m)}, then 

"^Ad — + -Rd 

a 

with X — 2N and so, for Example 2, we have 

Rd^ r(d,m) <u;(d). 

m(mod d) 
d\m{2N-m) 



In general it follows from (1.1) and (1.2) that 

s{Av;z)^x y: y: MRd. (1.3) 

d\Il{V,z) d\Il{r,z) 

Writing W{z;uj) = Yl (1 ~ ^) deduce the following from (1.3). 

p<z,peV 

Corolleiry 1.1 We have 

\S{A,V;z)-XW{z-u)\< ^ 

d\u{r,z) 
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This corollary is known as the Sieve of Eratosthenes -Legendre. 

Lemma 1.1 The Mertens Formula. Let V{z) := 11 ~ Vp)- One 

p<z,p&¥ 

has 

yi^) = f^+o{j^^). (1.4) 

log 2; (log ,2)"' 

Proof. See, for instance, Prachar M, pp. 80-81.] 
Example 1 (continued). We have 

X X 

Ad = {nen: nd< x}, #Ad = [^] = 7 + 

and P = P, so that we may take X = x, uj{d) = 1, l-Rd] < 1. Therefore it 
follows from Corollary 1.1 and Lemma 1.1 that 

SiA, V; z) = x^{l + 0(l/(log^))} + 0(2-(^)), 
log 2; 

since 

d\n(V,z) d\U{V,z) 

Thus 

S(A,V; z) ~ x^ — 
log 2; 

as soon as z < log x, since 

= exp(-^{l + 0(l/log^)}log2) 
log z 

< exp( /"f"' {l + 0(l/loglogx)}log2) 

log log X 

< exp(^logx) 

= o{x/logz) 
for z < logx, X — ^ 00. 
Corollary 1.2 For z < logx we have 

X 

7^ |n G N : n < X, p\n =^ p > z forp G P} ~ -; . 

log 2; 
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We would like to extend the admissible range of z in such a result. The 
proof of Corollary 1.2 uses the fact that 



\- 1' (^,n(p,z)) = i, 

2_. 0^ (n,n(7',z))>l. 



However the sum here is over an uncomfortably large range. We therefore 
replace the above equality with two inequalities, and encounter the following. 
Sieve problem. Find two real- valued functions ii^{d) and n^{d), of suitably 
small support, satisfying the conditions 



V u-(d)<l ^' (^'n(P,2;)) = 1, 

d\n. 

and 



d\n, d\n{V,z) 



T /i+(d)> "X' r ' (1-6) 

It follows from (1.6) for example, that 



S(A,V;z) = Yl 1 

neA, (n,n(P,z))=l 
nEA, d\n 

d\u{r,z) 
d\n{v,z) 

H+{d)uj{d) 



- X y: ^^^+ E /.-(ci)i?.. (1.7) 

d|n(7',z) d\n{V,z) 



Hence 



S{A,V;z)<X Y: ^^-^^+ Y l^^^mW- (1-8) 

d\U(V,z) d\n{V,z} 

To minimise the right-hand side of (1.8), subject to the condition (1.6), is in 
general a challenging imsolvcd problem. 

Some achievements of sieve methods. 

(i) We have 

#{(p,p') ■P,p'eF, p + p' = 2n}« "^""^ 



n (logn)2 



where o"(n) is the sum of divisors function. This is conjectured to be best 
possible, up to the value of the implied constant, 
(ii) We have (Chen §) 

# : P e P, p' e P2, p + p = 2n} > 



(logn)^ ' 

where P2 is the set of positive integers which are either prime or a product 
of two primes. 

(iii) We also have (Chen ||) 

# {j9 G P : p < X, p + 2 G P2} > ^ 



(logx)2 
(iv) We have (Iwaniec ||TT| ) 

# {n < X : + 1 G P2} > 

logx 

(iv) We have (Heath-Brown 0) 



{n = l (mod k) : n G P2, n < A;^} > 



(f){k) log A; 
for large enough k, if h.c.f.(/, k) = 1. 

2 Selberg's sieve 

To satisfy (1.6), let 

fi+{d)= w (2.1: 

d=ldi,d2] 

with Ai = 1 and G M. Clearly, 

d\n,d\n(V,z) [di,d2]\(n,U{V,z)) 

E VAd. = ( J] Xdf>0. 

di,d2\{n,U(V,z)) d\{n,U(V,z)) 

Moreover, if [n, Il{V, z)) = 1 then 

J2 f^^id) = /i+(l) = A? = 1. 

d|n, d\n{P,z) 
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Hence iJ,'^{d) satisfies (1.6). 

We shall minimise the main term on the right-hand side of (1.8) in the 
class of functions given by (2.1). Let 

d\u{r,z) 

If Ad = for > ^ , then /x+(rf) = for > y . Now, 

[di,d2mr,z) L 
Suppose that = for d > ^, then 



5o= E 



di d2 ou{{di,d2)) 

di,d2\n 

di,d2<i 

where ^* omits the terms for which uj{di)(jj{d2) — 0. 
We now introduce the assumption 

Q<uj{p)<p for peV (Jli), 

which we shall refer to in future merely as Condition (Qi). Then if iJt,{k)ijj{k) ^ 
we find that 



k 



\k 



uj{k) J- p 
p|fc 

> 0. 

Thus we may define a non-negative multiplicative function g{k) by 

The Mobius inversion formula then shows that 

d 



^ n(h\ 



k\d 



for //(d) a; (d) 7^ 0. Under condition (Qi) , it now follows that 
cj, ^ ^ a;(di)Adi uj{d2)Xd2 J_ 
di,(i2|n «l(<ii,d2) 



with 



a;(d)A, 



2 



^(0 V ^ 

z|n(p,z) 'i|n(p,z) 

l<£. l\d,d<^ 



Lemma 2.1 H^e /lave 



d 

l\U,d\l 



if d < ^ and d\Il. 
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Proof. When d\Il we have 



'\n,i<^ i\u,i<^ s\u,s<^ 

d\l d\l l\S 

= E ^Ez-w 



S 

<i|<5 



(i|<5 

= Mrf) E ^EMr") 

_ ii{d)uj{d) Xd 
d 

as claimed. Here wc have used the fact that 11 is square-free, so that m and 
d are coprime for md\Il. 

Since Ai = 1, it follows from (2.2) that 



yi 

i\u{v,z),i<^ i\u{r,z),i<^ 

and therefore 



z|n(p,^),/<4 i|n(7',^),i<« 
by Cauchy's inequality, where 

i\u,i<e, i\n{p,z)j<s, 



Thus So > G{^, z) ^ , and 5*0 = -z) ^ if and only if there is a constant 
c such that 

yi 
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for every I, this being the condition for equahty in Cauchy's inequahty. For 
the optimal values yi — c^{l) g{l) Lemma 2.1 yields 

1 = ^^^/.(l) = /^(O^^ = c E ^'^^f9il) = cG{^: z), 

i\n,i<£, i\n,i<(, 

whence 

_ 1 

Thus 5*0 = z)~^ providing that yi — g{l)G{^, z)~^, in which case 
(2.2) produces 

^ ^ «|n,(i|i,;<e 
i|n,d|i 

H{d) dg{d) 



d7|n(p,z),j<| 



On recalling that 



one obtains the minimising condition 

A. = M^)n(l-^r## for d<e, (2.3) 

with 

G,(e,^):= E 9U): G^{t z) ^ G{t z) . 

dj\niv,z),j<i 

The choice of variables (2.3), under the assumption 

Ad = for d>^, (2.4) 
turns (1.8) into the inequality 

S{A,V;z)<XG{^,z)-' + J] \l^+{d)\\Ral (2.5) 

d\U{V,z),d<y 

since iJ,~^{d) = for d > in view of (2.4). 

In order to bound we will require the following result. 
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Lemma 2.2 Let d\U{V , z) . Then 

G{(,z)>gAz)1[(1-^)-\ (2.6) 

p\d 



Proof. Let d\n{r,z). Then 



l\d ^, 

m|n, (m, d)=/ 

E ^(^^) 

;/i<^,(?i,d/o=(fe,/)=i 

= E^(o E ^(^) 

(?i,(i/0=(?i,0=l 

= E^(o E ^(^) 

l\d h<ill,dh\^ 

> E^(o E 9{h) 

l\d h<e,/d,dh\U. 

- e^(og4'"^' 

l\d 

since ^/d < ^/Z and g{h) > 0. The inequahty (2.6) follows now from the 
identity 



Z|d p|<i p|(i 

in view of the condition {fli) . 



p\d p\d ^ ^ p\d ^ 



The "Fundamental Theorem for Selberg's sieve". Assume (Qi) . Then 

X 



d<y,d\n{r,z) 
for ^ > I , where 

i:=yK v{d):= J] 1, G{i, z) := J] g{l) 
p\d,p& i\n{v,z),i<i 
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with 

p\d ^ 

Proof. It follows from the relations (2.3) and (2.6) that \Xd\ < 1. Therefore 

[di,d2]=d [di,d2]=d A=0 

for square-free d. Consequently, the assertion of the theorem follows from 
(2.5). 



3 Some applications 

We prove two corollaries of the Fundamental Theorem. 
Corollary 3.1 Let V = {p e P : p\k} and let 

0, p\k 



UJ 



Then 

(p{k) logz 



(d, k)=l 



Proof. When l\Il{V,z) we have 

*w-f n(i-t^)"' 



p\i 

so that 



G{z, z)= = J2 ^=-- Hk{z) . 

Kz 

Let K{n) be the largest square-free ivisor of n. One then obtains 

^.w = E^n(i-^)"'= E n(p.-ir' = 

l<z p\l l=pi...pf^<z i=l 
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Pl...pfe<2 

On the other hand, for any square-free k we have 



n<2 



(n, k)=l 

l\k, hl<z 
{h,k/l)=l 

' h<z/l 
(h,k)=l 

(f[l) \l 



l\k 



«'wn(i+A) 



p|A; ^ 



Combining the last two estimates, one obtains 
Thus 

G(.,.)>^log. 
so that Corollary 3.1 follows from the Fundamental Theorem with ^ 



13 



Corollary 3.2 Let V ^ {p & p \ k} and let 

f ^ / 0, p\k, 

Then 

p\k d<z^,d\U{V,z) 
p^2 {d,k)=l 

for even k, where 

p>3 ^ ' 

is the "twin prime constant". 
Proof. We have g{p) — ii p\k, and 

, , Lj(p) / i^ip)\~^ 1 1 1 /-, / w 

g{p)^^[l-^) = 7 = = -^{l+g{p)) 

P ^ P ^ (p_l)h__l_j p-2 ip{p) 

otherwise. Therefore 

as soon as {d, k) = 1 and n{d)^ = 1. Moreover, g{d) = when {d, k) ^ 1 . 
Thus 

G{z,z) = 5] ^(d) 

ci|n(-p,^),d<^ 

(i<z, ((i,A:)=l J|d ^ 



ip[ml) 

{ml, k)=l 
ml<z 

l<z,{l,k)=l ' (m,«jfc)=l,m<f ' 



i<2;, (Z,fc)= 
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In view of (3.2), 

l<z,(l,k)=l ' 

A; ^ I ^^l' 

l<z,{l,k)=l 

- k ^ I 

1=1 

{l,k)=l 

(i,fe)=i 



^ iff ^(^^-2) 

so that 

1 1 r 1 x\ /.T-r P- 1\ -i-r 1 



G{z,z) - logzi^^'^^logz^) (p-l)2^ 
since 

p p{p — 2)^ p — 1 

and ^ ^ 

for p 7^ 2. Corollary 3.2 now follows from the Fundamental Theorem. 
Our applications will require one further result. 

Lemma 3.1 Let h eN and set 

d<x d<x 

We then have 

^1 < a;(l + loga;)'' and ^2 < (1 + loga;)'^ . 
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Proof. Clearly, 
Moreover, 



Si < J^^^idf^h"^"^ ^ XS2. 



d 

d<x 



{di,...,dh) 



d\...dh=d 



d 

di...dh=a 
di,...,dh<x 
flidf^h 



d<x 

as asserted. Here we have used the fact that if //(d)^ = 1 and d — pi...ph, 
then 

/i^ = #{(cii,...,4)eN^ d,...dh = d}. 

First application. Let A—{n\x<n< x -\-y} and = P, so that 
j^Ad = ^ + 0(1), uj{d) = 1 for den. 

(Jj 

Then 

^ d<z2 

by Corollary 3.1. In view of Lemma 3.1, this gives 

S(A,V;z) < + 0(z^{l + \ogzf) . 

logz ^ ' 

On the other hand, 

<S'(^, P; = #{ n : x < n < x+y, p\n =^ p > z} > 7r{x + y) — 7r{x) — z . 

Taking z = (j^^^, it follows that 

2?/ ^ ^f y log log t/ 

3gZ/ V (logy) 

In particular, 

2l/ , log logy 



x(. + !,)-.W<^ + o(?^i^) forx.,>2. (3.3) 
logy V logy)^ / 



loev \ 



logy V (logy)2 
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It has been proved (Heath-Brown 0) that 



7r(x + y) — vrfx) ~ (3.4) 

log X 



for y > a; 12. In contrast, (3.4) is false (Maier |14]) for y x (logx)"^, for 
any constant A. Thus (3.3) is useful for relatively small y. Montgomery and 
Vaughan ||15| have removed the error term above and proved that 

2y 

tt{x + y) — 7r(x) < for x , y >2 . 

logy 

It has been conjectured that n{x + y) < tx{x) + 7r(?/) for all x,y >2. How- 
ever Hensley and Richards [jlO[ have proved that this would be incompatible 
with the fc-tuples prime conjecture. It is not clear at the moment whether 
the factor 2 in (3.3) may be replaced by a smaller number. Indeed Erdos 
apparently believed that the constant may be taken as 1, while Selberg is 
reputed to have suggested that no constant below 2 is admissible. 
Second application. Suppose that (/, k) = 1 and let 

A = {n < X : n = I (mod k) }, 

7r(x; k, I) := i^{p G P : p < x, p = I (mod k) }, 
V = {p e ¥ : p^ k} . 



Clearly, 

Moreover, 

with 
so that 



7i{x; k, I) < S{A, V;z) + l + y. (3.5) 

k 



1, if {d,k) = 1, 
0, if {d,k) ^ 1, 



S{A,V;z) < -^^^ + V3^WMrf)' 
ip{k) k logz 



X 



(p{k) log z 



+ 0{z\\ogzf) 



by Corollary 3.1 and Lemma 3.1. Let z = {x/ky/'^(\ogx/k) ^. Assuming 
that X > 4:k, say, the estimate (3.5) yields the following result. 
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The Brun-Titchmarsh Theorem. We have 

( n ^ ^ r^( ^ \og\ogx/k \ 

/or X > Aip{k). 

The Siegel-Walfisz Theorem gives 

7r(x; /c, /) ~ — — ^ for A; <^ (logx)^ 

V9(/i;) logx 

for any fixed A, so that the constant 2 in the Brun-Titchmarsh Theorem 
may be replaced by 1 if a; is sufficiently large compared with x. Moreover 
Montgomery and Vaughan |TB| have proved that 

2x 

Tr{x;k,l) < —— — for k<x. 

ip[k) logx/k 

The constant 2 in (3.6) is presumably hard to improve, for it is known that if 
one could replace 2 by 2 — 5 with a positive constant 6, then it would follow 
that there are no "Siegel-Landau zeros" . 
Third apphcation. Let 

A = {2N ~p : p e V, 3 <p <2N -3}, V = {p e F : {p,2N) = l}, 

so that 

Ad = 4^{p e F : p = 2N {mod d), p\2N, p <2N -3} 
= 7r{2N;d,2N) + 0{l + iy{N)) . 

When h..c.i.{d,2N) = 1 we expect that 

Li(2iV) 



tt{N; d,2N) 



for N large compared with d. We therefore take X = Li(2A^) and uj{d) 
ip{d)~^d. On writing 

E{x] k,l) = 7i{x; k,l) — 



m 

it follows that 

Rd = E{2N; d, 2N) + 0(l + p{N)) . 

On the other hand 

r{2N) ■= i^{{p,q)\p,q e F, p + q = 2N} < S{A, V;z) + z 
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and we deduce from Corollary 3.2 that 

p\2N 

In order to estimate the remainder sum 



5- ^3^('^)/x(ci)2£;i(2iV,ci), 



where 

Ei{x,k):— max \E{y;k,l)\, 

y<x, {l,k)=l 

we shall use the following well-known result. 

The Bombieri- Vinogradov Theorem. For every Ci > 0, there is a posi- 
tive constant C2 such that 



X 



(logx)^! 

k<x^ (loga;)"=2 

Using Cauchy's inequality with Lemma 3.1 we find that 

S-2 < {5](3^('^V(rf)2)2i}{5]dEi(2iV,cif} 

d<z^ d<z^ 

« (log^)9 J]o?Ei(2iV,ci)2. 

It is trivial that 

for A; < X, so that 

E(.; k, I) « 4, + ^ « ^ « — . 

Thus 

J2dEi{2N,dy < Ar(logiV) J2 Eii'lN.d) < AT^ (log AT) -is, 
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by the Bombieri- Vinogradov Theorem with ci = 16, on taking 
We therefore conclude that 

Ar2 



52 « 



(logA^)6' 
Thus 

J2^"^'^^KdfEi{2N,d) = OiN {log N)-"^). 
Moreover, it follows from Lemma 3.1 that 

(1 + u{N)) J2 r^'^m' « (1 + HN))VN{\ogNf « . 

We may now deduce the following result from (3.1). 
Theorem 3.1 We have 

r(2iV)<{4 + 0(i^|^)}a(iV), 

with 

p\2N 

It is conjectured that one may improve the Bombieri- Vinogradov Theorem 
to say that for any e > and any ci > one has 



V Ei(x,/c)<— ^ 
(logx 



X 



One would then obtain a bound 

r{2N) < (2 + o(l))a(A^) 

in a completely analogous fashion. However the best unconditional result is 
due to Chen 0], in which the constant 4 is reduced to 3.9171. For comparison 
we note that it is conjectured that r{2N) ~ a{N). 

The following theorem can be proved in the same way as Theorem 3.1 
(Exercise!). 
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Theorem 3.2 For any positive integer k we have 



#{p<x : p,p + 2kEF}<A{l[^-^)C2 



X 




log log X 



))^ 



p\2k 



(logx) 



\ogx 



Corollary 3.3 (Viggo Brun) 
We have 




Bombieri, Friedlander and Iwaniec have proved a variant of Theorem 
3.2 with the constant 4 replaced by 7/2. Their method does not establish 
a result uniform in k and is therefore not applicable to Theorem 3.1. More 
complicated methods allow one to reduce the constant in Theorem 3.2 further 
slightly. 

We proceed to discuss briefly some other applications of Selberg' sieve. 
Definition. Suppose that 



Then the constant k is called the dimension of the sieve problem. 

One can get by with a slightly weaker assumption in fact. The above 
definition corresponds to a version of the condition Q{k, L) in the book by 
Halberstam and Richert [|, page 142]. 

Remark. Since 



the dimension of the sieve problem coincides with the "average value" of 
uj{p). Note that in the two cases considered in this section we have k, = 1. 
This is clear for both 




for 2 < w < z. 



(3.7) 





p\k, 
p\k, 



and 




p\k, 
p\k, 
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In general, for the sieve problem of dimension k, one obtains 

d\U{V,z) pI*^ 
d<z 

\2 



d<z ^ p\d ^ 



Though we shall not prove it, it turns out that 



G{z, z) = J n (1 - (1 + 0{t^)) 

see Halberstam and Richert ||^, (5.3.1)]. It therefore follows from the Funda- 
mental Theorem that 

S{A,V,z) < xll{l-^)e-^TiK+l){l + 0{-^)) 

p<z ^ ° 

+ J2 ^"^'^^\Rd\. (3.8) 

d<z^,d\n{P,z) 



Note that if uj{p) = 1, then 



n(i-i)~f^ 

J.J. ^ log 2; 



One therefore obtains 



S{A,F-z)<-^{l + 0{-^)) + ^"'^'^W^ 
log^ ^ Uog2^^ 

cf. (3.1). 

Example. Let 

A = { n{2N -n) : 3 < n < 2iV - 3 }, 

so that 

2N 

^Ad = i^{n\l<n<d,d\n{2N-n)}{— + 0{l)) 
= ^2N + 0iuid)), 
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where 




p\2N, 
p\2N. 



In this case k = 2 and the estimate (3.8) imphes r{2N) < (8 + o{l))a{N) 
( Exercise!). This should be compared with Theorem 3.1, which was de- 
duced using a sieve of dimension 1, together with the Bombieri- Vinogradov 
Theorem. 



4 The parity phenomenon and hmitations to 
sieve methods 

The optimisation problem for the upper bound sieve amounts to the question 
of minimising the linear functional 

d\nip,z) d\n{p,z) 
under the additional condition that 



^ ^ w \ 0, otherwise, 



(i|n,(i|n(P,z) 

(cf. (1.5)-(1.7)). We can view this as a linear programming problem. In the 
standard formulation of a linear programing problem one takes a real m x n 
matrix A and real column vectors b and c of lengths m and n respectively. 
One then seeks to minimize c*x over all column vectors x e R", subject to 
the conditions that x > and Ax > b. (Here z > w means that Zi > Wi for 
each index i.) In our problem the vector of values of ^*{d) is not required to 
be non-negative, so wc introduce two new functions /^±((i) with /^±((i) > 
for all d, and write iJL*{d) = n*^{d) — n*_{d). We can then produce a linear 
programming problem in standard form. 

A great deal can be learnt about a linear programing problem by studying 
its "dual". For the problem described above, the dual problem is to maximize 
y*b over column vectors y e M"*, subject to y > and y^A < c*. Under 
these constraints one clearly has 

c*x > y*Ax > y*b (4.1) 

and the Duahty Theorem states that there exist vectors x, y for which equal- 
ity is attained. 
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In the context of the upper bound sieve problem, it transpires that the 
dual problem is essentially that of finding a sequence A, with prescribed 
function uj{d), and with suitably small remainders Rd, for which S{A,V; z) 
is as large as possible. We can interpret the inequalities (4.1) as saying that, 
for any vector y = yo which satisfies the relevant constraints, we must have 

inf c*x > y^b, 

and moreover, if we have vectors x and y which both satisfy the relevant 
constraints, and for which c*x = y*b, then both x and y must be extremal. 

For the upper bound sieve problem, any sequence A defining a problem 
of dimension 1 will therefore produce a lower bound on the possible values 
of /(/X*). Moreover if we can find a sequence A and a set of coefficients ^*{d) 
for which <S'(^, V] z) (corresponding to y*b) and /(/x*) (corresponding to c*x) 
are approximately equal, then both must be essentially optimal. 

We therefore examine in detail the following two sequences, first discussed 
by Selberg. Let Vt{n) be the number of prime factors of n, counted according 
to multiplicity, and define the Liouville function \{n) by 

\{n) = (-!)"("). 

We then set 

^+ = {n e N : n < x, A(n) = -1 } , 

and 

A~ = {ne N : n<x, \{n) = +1 } , 

which will relate to the upper bound and lower bound problems respectively. 
In the case of the first sequence we have 

S{A-^,¥-z)^t:{x)-t:{z)^-^ + 0(-^— ) + 0{z) for ^ > . 

\ogx ^[iogxy 

Let us sec how this compares with the bound given by the Selberg sieve. To 
bound Rd for the sequences A"^ we note that if 

L{x) :=^A(n), 

n<x 

then L{x) <^ Ec{x), where c is a suitable positive constant and 

Ec{x) := exp(-cv^logx). 
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(This follows by a similar analysis to that used for the summatory function 
of Now, if we let X — ^ then 

= #{m<^: X{md) = Tl} 

= *{m<^: A(m)=TA(d)} 

[x/d] ^^^^^ L{x/d) 

for d < X, and hence the remainder sum in the Fundamental Theorem is 

d<y,d\n{V,z) d<y, 



d<y, d<y 



d<y 

in view of Lemma 3.3. Furthermore, 

< d^^ exp{—2c\/\ogx/y) 

d<y 

< x'^{logy) exp{—2c\/logx/y) 

for y < X . Thus 



^2^"^'^'^ \Rd\ < (logx)^xe-"Vi^< x(logx)-' (4.2) 



2 



for y < Ei{x), say. By Corollary 3.1, we have 

X 

^ d<z 

On taking y = Ei{x) and z = y^^"^ one deduces from these estimates on 
recalling that x — 2X, that 

X XX X 

S{A\F;z) < ^^g^y^) + ^((I^) =i^ + ^((b^)- 
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Since 



S{A+, P; z) = 7r{x) - 7r{z) = + O ,J + 0(z) for z > x'/^ , 

logx ^{logx)^^ 

we conclude that the estimate (3.1) cannot be improved on for z > x^^^. 

Tiius the Selberg sieve is best possible in this situation. Our remarks 
about linear programming then show that the Selberg sieve coefficients are 
an essentially optimal solution to the minimiztion problem for /(/U*), and that 
the sequence A'^ is a corresponding solution for the dual problem. 

Turning to the lower bound sieve problem, we see that we can satisfy the 
relevant constraints 

V i^*(d)<( ^' (n,n(p,z))-i, 

H- \ — \ 0, otherwise, 

d\n,d\n{r,z) 

by taking n*{d) to be identically zero. For this choice we produce the trivial 
lower bound 

S{A,r;z)>X Y: Y: \f^%d)\\R,\^0. 

d\n{V,z) d\Il{V,z) 

We now observe that for the sequence A'^ we have S{A~ ,F, z) = 1 for z > 
x^^"^, since only the integer 1 G A^ is counted. Thus the coefficients fi*{d) = 
are essentially best possible for the linear programming problem in this 
situation, and the sequence A~ is essentially optimal for the corresponding 
dual problem. 

Thus no set of lower bound sieve coefficients with < 3'^^'^) 

can produce 



E 



II* (d) 1 



> 



: , s d lOgV 

when z > x^^'^. In particular one cannot show that tt{x) ^ x/logx by sieve 
methods alone. 

The two sequences A'^ and A~ produce the same information input for the 
sieve. They have the same X, the same function u!{d), and their remainders 
Rd have the same order of magnitude. Thus there is no way that the sieve 
machinery can distinguish them. It is for this reason that the sieve encounters 
the parity phenomenon, since it is unable to distinguish integers for which 
fl{n) is even, from those for which a;(n) is odd. 

The sequences A"^ have been shown to be essentially extremal for z > 
x^/^, but it transpires that they are optimal for all z. To examine this fact 
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we define 

= #{n e N : n < X, X{n) = ^1, p\n =^ p > x^^^ } 

for s > 1. We classify the integers n according to their smallest prime factor 
p. Then if 

77+ = 0, 77_ = 1 

it follows that 

S^{x, s) = : pm < x, A(m) = ±1, p'\m ^ p' > p} + r]± 

c^fX logx . 
V S+{-, 1) + r]±, 



smce 



with 



X ' 



P= - 



, _ logx I p 
s — 



logp 

This leads to recursion formulae for S^{x, s). To produce appropriate start- 
ing values for the recursions we note that 

S-{x,s) = 0{\), (l<s<2) 

and 

S^{x, s) = 7r{x) - 7r{x^/') + 0(1) (1 < s < 3). 
Let us define continuous functions F, f : M>o — > M by the relations 

F{s) = — for < s < 3, f{s) =0 for < s < 2, 
s 

and 

{sF{s)y^f{s-l), {sf{s)y^F{s-l) fors>2. 
We note that these definitions show that 

/(s) = 2e^^-^^^^^ for 2 < s < 4. 

We shall now prove the following estimates. 
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Theorem 4.1 Let N e 'N. Then we have 

S+{x, s) = , F{s) + On(-, r^) (4.3) 

anc? 

s) = , /(s) + Ojv(7r^^) (4.4) 

^ ^ eT log(xV5) ^ ^ ^(logx)2^ ^ ^ 

/or 1 < s < A^. 
Proof. Since 

'^'^ F(s) = -^ (0<s<3) 



e'>' log{x^/^) logx 
and 

5+(a;, s) = n{x) - n{x^/') + 0(1) (0 < s < 3) , 

relation (4.3) holds for N = 2 and N = 3. Similarly, when N = 2 the 
equation (4.4) follows from the facts that /(s) = and S~{x,s) = 0(1) 
whenever 1 < s < 2. We now prove (4.3) and (4.4) by induction on N. We 
shall consider only S^{x, s), leaving the discussion of S^{x, s) as an exercise. 
Thus we assume that (4.3) and (4.4) hold for N, and deduce that (4.3) holds 
for + 1. We therefore let < s < + 1, with A^ > 3. Since 

< 5^ - 1 < 2 for p> x^'' , 
logp 



it follows that 



uf-", P P 1/,^^ P lOgp 

Moreover, if p > x^^^ then |^|^ — l<s — 1<A^. Therefore, by the inductive 
assumption, we have 

5-(^,;^!5£_i).P£!):^/(!^«£_i)+04^^) (4.5) 

logp ^ P logp ^logp ^ ^p(logx)''^ 
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for x^/^ < p < x^/^. By partial summation we find tliat 

-^^^ ologp Mogp J^i/s tlogt Mogt Mogt 



X 

ilogx) 

X 



^ '.F(.)-3F(3)) + 0^(-^), 



log a; (loga;)^ 
on substituting t = x^^"". To handle the error term in (4.5) we note that 



V - X X 

^ »(logx)^ (logx)^' 

whence we conclude that 

S^ix.s) = ^ (sF(s)-3F(3)) + -^ + On( ,, ^ ,J 

^ ' ' 2eTlogx^ ^ ' ^ " logx ^{logxY' 

2enogxi/«^^^) + ^^((logx)2^' 

as required. 
Remarks. 

1) The properties of the functions /, F and their generalisations are dis- 
cussed in detail in the books by Greaves and Halberstam and Richert 
Chapter 8]. 

2) If we set 

W{z) :=n(l-^)' 



p<z ^ 



then the Mertens formula (1.4) gives 



e7log(a;V^) ^ ' "'(logx)^^' 

so that (4.3) and (4.4) imply that 

S^x,s)=XW{x'/nF{s) + 0^{- ^) 

and 

S-ix,s)=XWix'/-')fis) + 0^(-^) 

respectively. 
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5 The Rosser sieve 

A combinatorial sieve is defined by choosing sets 

T+(y),T-(y) C{deN:d<y, ii{df = 1} 

and taking 

L U, otiierwise. 

The sets T^{y) have to be chosen so that H^{d) satisfy (1.5), and (1.6) 
respectively. As in the proof of (1.7) we have 

S{A,V;z)<X Y: ^^^^+ E l.^d)R,. (5.2) 

d\U{V,z) d\U{r,z) 

It follows from (5.1) and (5.2) that 

I l^^id)Rd\< Yl 

d\U{V,z) d<y,d\U{T,z) 

We have to choose T^{y) so as to optimise the main term in (5.2) subject to 
the condition (1.6). It follows from (1.8) and its analogue for ji" that 



d<y,d\n{V,z) 



d 

dmV,z) 
de T-(y) 



and 

S(A.P;z)<X Y: '-^^+ E 

d\u{r,z) d<y,d\ii{r,z) 

de T+{y) 

One way to arrange for (1.5) and (1.6) to hold is as follows. Let 

T:^{d\de N, ix{df^l}, 
write d — piP2P3--- with pi > p2 > ps-.. ioi d E T, and let 

Tr = {d\de T, {u{d) < r) or (z/(rf) > r k^ipi, ...,Pr))} 
for some predicate ^ to be defined later. Let 

oo oo 

T+{y) = f]T2u-i, T-{y)=f]T2u. 

u=l u=l 

With these definitions we have the following lemma. 
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Lemma 5.1 Form e N andT^{y) defined as above, we have 

d\m d\m d\m 

Proof. Let 

then 



00 



T\T+{y) = {deT:3v{d^ T^.-i) } = |J B2U-1 



Moreover we have Bj Ci Bk = ^ ii j ^ k. Therefore 

d\m d\m,deT+{y) 



where 



We set 



u=l d\m,dEB2u-i 



Cr^{de Br : u(d) = r }, 
and if d > 1 we write p{d) for the smallest prime factor of d. We then define 

peF,P<p{d) 

Now, if d G Br, then d ^ Thus u^d) > r and ^{pi, ■■■,Pr) does not hold. 
Write d = ef with e = pi...pr and f\Q{e). Since ci G -Br, it follows that each 
property ^(pi, ...,Pr-2), V{Pi, •••,Pr-4), • • • holds. Now let 

Cr ^ {d e Br : vid) = r } , 

so that e G Cj.. Hence ii d & Br then we can write d = ef with e G and 
f\Q{e). Clearly, the decomposition d — ef with e E Cr and is unique. 
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Conversely, il d — ef with e e and f\Q{e)^ then d e Br- Therefore 

Hid) = Yl /^(^/) 

d\m,deB2u-i e/|m,eeC2„-i 

/IQ(e) 

e|m 

E /^(^) 

e|m, (m,Q(e))=l 

eS C2t,-i 

= - E 1 

e|m, (m,Q(e))=l 

< 0, 

since e e C2U-1 imphes that i/(e) —2u — 1 and hence that //(e) = —1. Thus 

00 

^ = E( E M)<o, 

d\m 



u=l J _ 

"G -t>2u-l 



SO that 



d\m d\m,deT+(y) 

= Em^^)-^ 

(i|m 

> E/^(^) 

(i|m 



as claimed. The inequality 

(i|m d|m 

can be proved in the same way. 

This completes the proof of Lemma 5.1. However a useful alternative 
way of viewing the combinatoric facts used in the argument is as follows. We 
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have 



"■^•^ d\n{v,z) 

d\n 

oo 

= E E M*M-EE E "M 

"^•^ d|n(P,z) «-ln6^ d\n,d\n{V,z) 

d\n deB2u-i 

oo 

= Y: ^-mAd + YY Y 1 

'^in^''-^) "='"^'^ e|(n,n(P,.)),eeC2„_i 

(n,n(7',z),Q(e))=l 

oo 

= Y l^+{d)4fAa + Y Y S{Ae,V;p{e)). (5.3) 

e|n(7',^) 

Thus far, all we have said applies to any predicate and any sieve 
problem. We now specialize to a sieve problem of dimension 1, and examine 
(5.3) in the particular case A = A'^, which we expect to be extremal. Here 
we have 

S{A-^,F;z) ^ , ^ w , F{s) + On(-, nj) 

^ ^ eT log(a;Vs) ^(s - l)(logx)2^ 



for 1 < s < A^, by Theorem 4.1. Moreover the estimate (4.2) shows that 

d 



Y ^i-^mAt-x Y ^+o(x(iogx)-^). 



d\U{¥,z) d\U{]?,z) 

We therefore conclude that 
S ^ = jS + £ S S(yi:,P;p(e)) + OM(.-l)-'(logx)-), 

d\n{V,z) ^ u=l eeC2u-i 

where y = Ei{x), z = x^^^ and 1 < s < iV. If we replace s — (log x)/(log z) 
by s' ~ (log y) / (log z) then the right hand side above is 

+ E Y S{At,F;p{e)) + 0^{{s'-l)-\logx)-'/') (5.4) 

eeC2u-l 
e|n(P,z) 
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for 1 < s' < N. We then re-define s as (log y) /(log 2;) . This produces an 
upper bound for the sum 

d\n(F,z) 

which involves F{s) together with information about the property ^ incor- 
porated in the definition of the sets Cr- Since our goal is to minimize the sum 
(5.5), we aim to choose ^ so that S{A^,f';p{e)) is as close to as possible 
for e G C2u-i- However the relevant integers e all have i/(e) = 2u — l, so that 
z/(m) is even for any m e A^- Moreover, every such m satisfies m < x/e. 
Hence we would have 5'(^+, P;p(e)) = 1 providing that x/e < p(e)^. Look- 
ing back at the definition of the set C2U-1 we see that we would want to 
have 

P1P2 ■ ■ ■ P2u-3pIu-1 > X 

whenever ^(pi, . . . ,P2u-i) is false. Making a marginal adjustment to produce 
a condition which involves y rather than x we therefore take the property 
^(pi, . . . iPr) to say that 



PiP2---Pr-iPl < y, 



whence 



(r+l)/2 

S'^iy) = n ^ ^ ■ ^ ^' ^ P^-P^ =^ PiP2-plt-i < y)} ■ 
t=i 

Although we have not made S{A^,'P;p{e)) completely vanish for e in 
C2U-1 it can be shown that this construction docs indeed make the sum in 
(5.4) suitably small. We have therefore produced an admissible set of upper 
bound sieve coefficients fi'^{d) which match up with the Selberg sequence A^, 
and the linear programming argument then shows that both are optimal. 

One can discuss the lower bound problem in exactly the same way, using 
the sequence A~, and leading to the choice 

r/2 

s~{y) = n ^ ^ ■ '"(^)' = 1' = Pi-Pr ^ pm-plt < y)} ■ 

t=i 

The construction of ^"^{d) we have been led to is known as the Rosser-Iwaniec 
sieve for dimension 1, there being variants in other dimensions. (The reader 
should note that, except for dimensions 1 and 1/2, the general Rosser-Iwaniec 
sieve is not known to be optimal. Indeed in many case it is known not to be 
optimal.) One noteworthy feature of the construction is that the definition 
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of the weights /i^ [d) does not involve either the parameter z or the function 

Lu{d). 

Although our discussion has been concerned with the case uj{d) = 1, 
the Rosser-Iwaniec weights may be applied to the general sieve problem of 
dimension 1 . Thus if we set 

d\U{V,z) 



we will have 



and 



S{A,V;z)<M+{z,y)X+ (5-6) 



d\U{V,z) 



SiA,V;z)>M-{z,y)X- ^ \R,\. (5.7) 

d\U(V,z) 

Iwaniec has established the following bounds for M~^{z, y) and M~(z, y). 

Theorem 5.1 Suppose that 

^ uj{p) \ogp z 

> < log- + 1 

^-^ p w 

w<p<z 

and write, as usual, 

p<z ^ 

Then 

M^iz,y) <Wiz){Fis) + Oie-'ihgy)-'/')} 

and 

M-{z,y) < W{z){f{s) + 0{e-\\ogy)-^'-')} . 

We shall not prove this theorem here. 

Remark. Note that Iwaniec requires only a one-sided condition, in con- 
trast to the two-sided condition (3.7) introduced in the context of the Selberg 
sieve. Thus Theorem 5.1 applies to sieves of dimension less than 1, and even 
to certain problems without a well-defined dimension. 

In view of (5.6) and (5.7), one obtains the following inequalities. 

Corollary 5.1 Under the condition of Theorem 5.1 we have 



S{A, V; z) < XW{z){F{s) + 0{e-\\ogyy^/^)] + \R 

d<y 



'd\ 
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and 

S{A, V; z) > XW{z){f{s) + 0(e-^(logy)-V^)} - J] 

d<y 

Example 1. Let ^ = {n e N : n<x} ,X ^x, u;{d) = 1. Then = 
since 

Prom Corollary 5.1 one obtains 

S{A, P; ^) > X n (1 - -) if is) + 0(1)) - Yl (5-8) 

Let y = (logx)'^ ^^'^ suppose that y^/^ < z < y^/^. Then 

log V 
logz 

whence 

^ 2enog(.-l) 
Thus (5.8), in conjunction with Mertens Theorem (1.4), gives 

s^Ax..) > .{r^^o(ji^)}(H:::Mi^+o((iog.)-v3)) 

^(loga;)"'^ 
{l + o(l)}log(.-l) + 0(-^). 



logy ^(logx) 
Hence 

log x 

for any constant value of s strictly greater than 2. To detect primes, one 
would need to consider the situation with s = 2. We just fail to find primes, 
which is not surprising since the sequence A~ contains no primes (the parity 
phenomenon!). 

Example 2. Let < ^ < 1, and choose 

A= {n : X — x^<n<x}, 

X — x^ , ijj{d) — 1, and 



X e 

y = — , z^x^ 
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■5 



with 5 > 0, so that 



logy 29 
and Rd = 0{1). As in Example 1, it follows that 



S{A,¥] z) > 



X 



log a; 



Hence, if x is large enough, the interval x — x^ < n < x contains at least one 
integer n all of whose prime factors p satisfy p > x^l"^"^ . In particular if r G N 
and Q > 2j{r ^ 1), then we may choose 5 > so that Qj2 — 5 > l/(r + 1). 
Thus we will have p > x^^^^~^^^ so that n can have at most r prime factors. 

In general we say that a positive integer n is an almost prime of type Pr, 
if it has at most r prime factors, counted according to multiplicity. We may 
then conclude that if ^ > 2/(r + 1), and if x is sufficiently large, then the 
interval x — x^ < n < x contains at least one Pr number. The necessary 
size for 6 can be reduced (see Example 1 after Theorem 6.2) and it is an 
interesting problem to know just small it may be taken. 

Example 3 (The twin primes problem). Let 

A = {p + 2 : p e F, p <x}, 
and take X = 7i{x) and 



uj{p) 



0, p = 2. 



In view of the Bombieri- Vinogradov Theorem, one can take y = x^^'^{log x) ^'^ 
to obtain 

^-^ logxr 

d<y V G y 

As above, one then concludes that 



(logx)"' 

Since s = (logy)/ {log z) we may therefore use z = x^ with any constant 
exponent 6 < 1/4. It follows then that the sequence A contains a growing 
number of P4 integers as x tends to infinity. As we shall see in Theorem 6.3, 
this has been improved by Chen [Q] who shows that the same is true for P2 
integers. 
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Example 4. Let A — ln^ + l : n < x}, and take X — x and 

{2, p=l (mod 4), 

0, p = —1 (mod 4), 

1, p = 2. 

Then Ra = 0{u}{d)) and 

^ V w ^ ' 

w<p<z 



If we take y = x{logx) ^ it follows that 



2 \ 1UMUW» timt 

d<y 



Thus if ^ = with a constant exponent 9 > 1/2 we will find that s > 2 and 
hence 

^(^,P; z) > 

log X 

It follows, on taking 9 > 1/5, that there are infinitely many P4 numbers of 
the form + 1. 

In a similar way one can prove that if / is an irreducible integer polyno- 
mial such that the values { f{n) \ n & N} contain no common factor, then 
f{n) — P2k infinitely often. Here also improvements are possible. 

We conclude this section with the following important result. 
The "Fundamental Lemma" Suppose that 



Then 

S{A,V;z) ^ XW{z), 

if log z = o{\ogy), that is, if s ^ 00. 

Proof. It suffices to observe that F{s) = 1 + 0{e~^) and f{s) — 1 + 
0(e~*), as s ^ 00 . 

Remarks. 

1) This result should be compared with Corollary 2.1, in which one re- 
quired z < logy. 

2) Since one can often choose y — for some (5 > it follows from the 
Fundamental Lemma that 

S{A,V]z) ^ XW{z), 
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ii z — X^*^^) with e{X) — > as X — > oo. By considering the sequences 
one sees that one cannot have such a result when e{X) /> 0. 

3) The significance of the Fundamental Lemma is that one can sieve out 
"small primes" (namely those below X'^^^^) as an initial stage in some more 
complicated argument, and still have an asymptotic formula. In order to 
make use of this information one usually wants a quantitative form of the 
Fundamental Lemma, but this is easily established. 

4) One can obtain analogous results on the weaker assumption that 



If the sequence A contains only positive integers n < N, and we can show 
that S{A,f,N^) > for some 6' > l/(r + 1), then we can conclude that A 
contains at least one Pr number. However one can often derive better results 
by using a weighted sieve, in which certain Pr numbers with prime factors 
below N^/^'^+'^^ are also counted. 

In general we let N — maXneA |^| and we choose constants < a < (3. 
We then set 




as z 



oo. 



6 The weighted sieve 



W ■.^W{A,r;a,p) 



E (1 



2^ 

p\n 
N"<p<Nl^ 



(n,U{V,N"))=l 



where the weights tWp > are to be chosen so that 





(6.1) 



p\n 
N°'<p<Nl^ 



This condition ensures that 



W < 






n&A, u{n)<r 
(n,n(P,iV«))=l 



p\n 

N°'<p<Nl^ 



< 




1. 



n&A, v{n)<r 
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If we know that 

X 



#{n e ^ : 3p>, N^<p< N^} « 

(logNy 



we can deduce that 

X 



■(logiV)2 
Thus if we can also show that 



we will be able to deduce that 

The optimal choice for the weights Wp is not known, however the choice 

= ^ (] - ^"SP ^ 
(r + l)/?-l^ /3 1ogiV^ 

leads to some fairly satisfactory results. These are known as Richert's loga- 
rithmic weights. 

We shall assume henceforth that 

(r + - 1 > 0. 

Then if p < N^^ we will have Wp > as required. Now suppose that n E A 
with h.c.f.(n, 11(7', N°')) — 1, and consider the sum 



5= E 



Wp. 



p\n 
N'^<p<Nf^ 

Then 

(r + l)/3-l ^ ^ plogN^ 

p\n 
N"<p<Nl^ 

> ^ 

- (r + l)/3-l^^ piogN^' 

p\n 
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since n has no prime factors p < A^", and 

Jog^^ 
p log AT - 

for any prime factor p > . However 

^logp < log|n| < logA^, 

p\n 

SO that 

p\n 

Hence if > r + 1 we will have 

as required for (6.1). 

We now examine the estimate (6.2). By definition, 

W ^ S{A,V;N'')- J2 WpS{Ap,V;N"). 

N°'<p<Nl3 

We plan to apply Corollary 5.1. We therefore make the assumption that 

X 



d<Ni 



{log Ny 



for some fixed 7 > 0, and we set y — N'^. According to Corollary 5.1 we will 
then have 

-7^ 



Moreover, since 



S{A,V;N'') >XW{N'^) (/(^) +o(l)). 

a 

_ Xujjp) u{d) 
p a 



it follows that 

d<i^,d\n(v,No') 

Moreover, 

N°'<p<Nf^ d<m/p,d\n(r,N") kKm 
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(log7V)2' 



since an integer k < N'^ can have at most 7/0; prime factors p > N"". It 
follows that 

N°'<p<NI} 

We therefore conclude that 



^ . P logA^" 'logA^' 



We summarize our conclusions as follows. 
Theorem 6.1 Suppose that 

AC zn [-N, N] 

and that 

#Ad — X — — h Rd , 

and assume that the following conditions hold. 

uj{p) logp 



(n) 

(Hi) 

(iv) 



J2 ^^^^^l^^logw/z + 0{l), (2<z<wy, 

z<p<w ^ 



P>a>0, (r + l)/3-l>0; 

#{n e A : 3pV, iV" < p < iV^^} < ^ 



(log7V)2' 
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(v) 

/(-)- E -^."-^F(^^f^)»l 

N"<p<Nf> ° 

where 

= ^ (1 - J2^P_] 
(r + l)/3-l^ plogN^' 

Then the sequence A contains ^ j^j^ , numbers of type Pr- 

Our task now is to examine condition (v) in the above theorem. Let 

u;{p) logp 



N"<p<t ^ 



and 



so that 



V w,^F('^^)= y ^^^^^h{p). 

^ ^ p MogA^" ^ ^ p ^ ' 

N°'<p<Nl^ N°'<P<NP 

By partial summation we find that 



= - / S{t)h'{t)dt, (6.3) 



since S{N"') = and h{N^) = 0. According to assumption (i) we have 
S{t) = logt — alogA^ + 0(1). The contribution to (6.3) arising from the 
error term is 

< / \h'{t)\dt= / I— J '-\dv. 



dv 



However 

^ ^ \ogN{r + l)P-l^v a 

whence 

dhiN'') 1 

\ ' < logA^r 
at' 
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uniformly ior a < v < p. We therefore conclude that (6.3) is 
- {\ogt-a\ogN + 0{l)}h\t)dt 



(r + l)f3 
Hence the inequality 

/(-) > 7 ^ r (- - ^)^(^) (6-4) 

is necessary and sufficient for condition (v) of Theorem 6.1 

In order to express / and F in terms of elementary functions we shall 
impose the condition 7/4 < a < 7/2. For this range we will have 

a 7/0; 

and 

^ a ^ — V 

where we have written 70 for Euler's constant, to avoid confusion with the 
parameter 7. Thus (6.4) is equivalent to the condition 

We can now perform the integration on the right hand side, to obtain 

log - 1) > , ^^.^ , {P log ^ - (7 - /3) log ^) . (6.5) 
We shall choose a = 7/4 and 

7 



/3 = 



1 + 3- 



(These are in fact optimal, as a relatively easy calculation shows. However 
we do not need to know that the choice is optimal to proceed.) The above 
values are compatible with condition (ii) of Theorem 6.1 providng that 

1 > (6.6) 
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Moreover (6.5) then reduces to 



'^'^ log 4/(1+3- 



logS 

which is a stronger condition than (6.6). 
We therefore have the following result. 

Theorem 6.2 Suppose the assumptions of Theorem 6 hold, with 

a = 7/4, and (3 = 



and with condition (v) replaced by 



where 

log4/(l + 3' 



:= r + 1 



log 3 



Then the sequence A contains ^ j^^^ , numbers of type Pr- 
Remarks. 

1) For r > 2 we have r — | < A,, < r — |. In particular we have A2 > 

2) The only parameters which enter into the theorem in a crucial way are 
iV, which measures the size of elements of A, and 7 which measures the size 
of the remainders, in terms of A^. The parameter 7 is often called the "level 
of distribution" (or more precisely, since we may not know the optimal value 
for 7, an "admissible level of distribution"). 

Example 1. Let 

^ = {nGN: X - <n < x}, 

and take P = P, X = and uj{p) = 1. Then i?^ <^ 1 so that we may choose 
any < 6. Then the assumptions of Theorem 6.2 hold true providing that 
7 > A^^. We therefore conclude that A contains a Pr almost-prime if x is 
large enough, providing that 6 > A^^. 

For r = 2 much stronger results are known. According to work of Baker, 
Harman and Pintz the sequence A actually contains a prime, for the 
exponent 9 = 0.525, which is smaller than A^^. Moreover Liu has show 



that there are P2^s as soon as ^ > 0.436. It would be nice to know that 
6 > 1/r sufficed to ensure the existence of P^'s in A, for every r. 
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Example 2. Let N be an even integer and put 

A = {N -p : pe P, 3<p<A^-3}, 
V ^ {pe¥ : p\ N}, 



'(P) = I ^ 
p-i' 



0, ifp\N, 



and X — Li{N). As in our discussion of this example in §3, we find, via 
the Bombieri- Vinogradov Theorem, that any 7 < 1/2 will be admissable for 
the remainder sum. Since A3 > 2 this suffices to show that^ contains a P3 
for large enough N, so that every sufficiently large even integer 2n may be 
written as a sum of a prime and a P3 almost prime. 

In this second example we see that A2 is only just less than 2, so we come 
quite close to handling P2^s this way. However to achieve this requires an 
ingenious new idea. 

Chen's theorem. Every sufficiently large even integer N is a sum of a 
prime and a P2 almost-prime. 

More precisely, for every sufficiently large positive integer N we have 



#{pe P: AT-p e P2}> 0.335C2( H ^) TT^' 

,^^ .xoP~2 [log Ny 



p-1^ N 
where 

n (^-(^' 

p\N,pjt2 ^ 

as in § 3. 

Sketch proof. Given an even positive integer A^, let 

A^{N -p : pe P, 3 <p < N -3}. 

Define V and u;{d) as in the previous example, and let 2; > 2. For n e N, let 
a{n) = 1 if 

n = P1P2P3, Pi prime, pi < N^^^ <P2 < Ps, (6.7) 
and let a{n) — otherwise. We then consider the sum 

z<p<Wi/3 nGA, {n,U(V,z))=l 

The first two terms of this may be thought of as giving a weighted sieve, with 
constant weights Wp = 1/2. 
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Write Sq for the contribution to 5*0 arising from those values of n which 
are not square-free. If 2; is a positive power of N then it is easily shown that 



(log7V)3' 

which will be negligible. (In proving this it is useful to note that if p^|n and 
(n,n(P,z)) = 1, thenp > z.) 

Let w{n) be the weight attached to n in the expression So- Clearly we 
have w(n) < 1 for every n. We claim that w{n) < for any square- free 
integer n & A, unless n is a P2- Subject to this assertion, we will then have 

5o<#{ne^:n = A} + 0(^^). 

To verify the claim take a square-free integer n E A with w{n) > 0. Then we 
will have (n,Il{V, z)) — 1. Moreover there can be at most one prime factor 
p\n in the range z < p < N^^^, and clearly any integer n < N can have at 
most two prime factors p > N^^^. Thus if n is not a P2 almost-prime it must 
be of the form (6.7), so that a{n) = 1. However it is clear that in this case 
we have w{n) — 0. This estabhshes the claim. 

The terms S{A, V; z) and S{Ap, V; z) are estimated from below and above 
respectively, just as in the standard weighted sieve. However it is necessary 
to choose z somewhat smaller than before, a,s z = N^^^^. As a result one has 
to evaluate /(5), for example, by numerical integration. 

However the key new ingredient is the treatment of the sum 

n€A, {n,n(V,z))=l) 

Hitherto the only information about A that we have used comes from the 
estimate 

^Aa = X^ + Ra. (6.8) 
However we now use the precise structure of A to re- write the sum above as 

eB} = S{B, P; + 0{N^/^), 

where 

B^{N- P1P2P3 : P1P2P3 <N, z<pi< N^/^ <P2<P3 }■ 

Thus we change our attention to a quite different sequence. This device 
has been called the "reversal of roles", or "Chen's twist". (One should note 
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however that although Chen's apphcation of this idea is arguably the most 
spectacular, the principle was independently discovered by Iwaniec, amongst 
others.) 

Although the set B looks complicated, it is, in fact, essentially as simple 
as A. An analogue of the Bombieri- Vinogradov Theorem can be establised, 
showing that if 

a 

with a suitable value for X, then 

X 



E 



Rd\ < 



(logX)= 



for any fixed 7 < |. 

The standard Selberg upper bound for S{B,F; N^) now allows one to 
complete the proof of a positive lower bound for Sq. 

Remark 

While the parity phenomenon gives a limitation to the power of sieve 
methods which are based purely on the relation (6.8), it is no longer relevant 
once one uses additional information. Thus the reversal of roles trick has the 
potential to circumvent the parity problem. 

Other applications of the reversal of roles trick. 

1) One can show via the circle method that are infinitely many triples 
of distinct primes pi,p2,P3 which form an arithmetic progression, so that 
P2 — Pi = P3 — P2- On the other hand it is an open problem whether or not 
there are infinitely many 4-tuples of distinct primes in arithmetic progression. 

However, one can combine the circle method with the sieve, and use the 
reversal of roles trick to give infinitely many 4-tuples pi,p2,P3,n in arithmetic 
progression with n a P2 almost-prime, (see Heath-Brown p). 

2) In Example 3 of §5 we applied the Rosser- Iwaniec lower bound sieve, 
together with the Bombieri- Vinogradov Theorem, to the sequence 

A= {p + 2: p eF, p < x}. 

This was enough to show that if < 1/4 then 

X 

(logx) 

In particular this shows that A contains P4 numbers if we choose 6 > 1/5. 

However by using the reversal of roles trick it is possible to show the 
existence of an admissible constant ^ > 1/4 for which (6.9) still holds, thereby 
showing that A contains P3 numbers, without the need for a weighted sieve. 



5(^,P,x^)>>^^— --. (6.9) 
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